Using machine learning method and subword unit representations for spoken document categorization

نویسندگان

Weidong Qu

Katsuhiko Shirai

چکیده

In this paper, we investigate the feasibility of using machine learning method and subword units for spoken document categorization as an alternative to using words generated by word recognition or keyword spotting. An advantage of using subword acoustic unit representations to spoken document categorization is that it does not require prior knowledge about the contents of the spoken document and could attack the out of vocabulary (OOV) problem. The context-sensitive learning method is efficient on large, noisy corpora and very suitable for subword-based categorization. Given that even the best phone recognizers make a large number of mistakes, to improve phone N-gram recall, we can once again use phone lattices to obtain the bag of phone N-grams for each speech document. In this study, we examine a variety of subword unit categorization terms and measure their ability to perform effective categorization work, and also have investigated the performance when the underlying phonetic transcriptions contain different recognition errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-scale-audio indexing for translingual spoken document retrieval

MEI (Mandarin-English Information) is an English-Chinese crosslingual spoken document retrieval (CL-SDR) system developed during the Johns Hopkins University Summer Workshop 2000. We integrate speech recognition, machine translation, and information retrieval technologies to perform CL-SDR. MEI advocates a multi-scale paradigm, where both Chinese words and subwords (characters and syllables) ar...

متن کامل

An Investigation of Subword Unit Representations for Spoken Document Retrieval

This study investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recogn...

متن کامل

Subword unit representations for spoken document retrieval

This paper investigates the feasibility of using subword unit representations for spoken document retrieval as an alternative to using words generated by either keyword spotting or word recognition. Our investigation is motivated by the observation that word-based retrieval approaches face the problem of either having to know the keywords to search for a priori, or requiring a very large recogn...

متن کامل

Phonetic recognition for spoken document retrieval

This paper describes the development and application of a phonetic recognition system to the task of spoken document retrieval. The recognizer is used to generate phonetic transcriptions of the speech messages which are then processed to produce subword unit representations for indexing and retrieval. Subword units are used as an alternative to words units generated by either keyword spotting o...

متن کامل

Multilayer subword units for open-vocabulary spoken document retrieval

This paper describes the application of subword units in an effort of improving open-vocabulary spoken document retrieval performance in the case of highly corrupted recognition output. This paper presents the developed open-vocabulary spoken document retrieval system including the newly proposed subphonetic segment unit and combining multilayer subword units. Our experiments on Japanese spoken...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2000

Using machine learning method and subword unit representations for spoken document categorization

نویسندگان

چکیده

منابع مشابه

Multi-scale-audio indexing for translingual spoken document retrieval

An Investigation of Subword Unit Representations for Spoken Document Retrieval

Subword unit representations for spoken document retrieval

Phonetic recognition for spoken document retrieval

Multilayer subword units for open-vocabulary spoken document retrieval

عنوان ژورنال:

اشتراک گذاری